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ABSTRACT 

Voids are a dominant feature of the low-redshift galaxy distribution. Several recent surveys 
have found evidence for the existence of large-scale structure at high redshifts as well. We 
present analytic estimates of galaxy void sizes at redshifts z ~ 5 — 10 using the excursion set 
formalism. We find that recent narrow -band surveys at z ~ 5 — 6.5 should find voids with char- 
acteristic scales of roughly 20 comoving Mpc and maximum diameters approaching 40 Mpc. 
This is consistent with existing surveys, but a precise comparison is difficult because of the 
relatively small volumes probed so far. At z ~ 7 — 10, we expect characteristic void scales of 
~ 14 — 20 comoving Mpc assuming that all galaxies within dark matter haloes more massive 
than 10 10 M Q are observable. We find that these characteristic scales are similar to the sizes 
of empty regions resulting from purely random fluctuations in the galaxy counts. As a result, 
true large-scale structure will be difficult to observe at z ~ 7 — 10, unless galaxies in haloes 
with masses < 10 9 M Q are visible. Galaxy surveys must be deep and only the largest voids 
will provide meaningful information. Our model provides a convenient picture for estimat- 
ing the "worst-case" effects of cosmic variance on high-redshift galaxy surveys with limited 
volumes. 
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1 INTRODUCTION 

The complex network of filaments and voids observed in the 
present-day Universe is believed to have formed from an ini- 
tially homogeneous distribution of matter. In hierarchical models 
of structure formation, tiny perturbations seeded by the inflation- 
ary epoch grew through gravitational instability, collapsing first on 
smaller scales to form haloes. The subsequent merging and clus- 
tering of smaller haloes resulted in the formation of highly struc- 
tured large-scale systems. Perhaps the most striking characteristic 
of the Universe today is the prevalence of large and nearly spher- 
ical voids in the galax y distribution. The scale s of these voids can 
be enormous. Indeed, Hoyle & Vogele^ J2004I) report characteris- 
tic radii of R ~ 15/i~ 1 Mpc with maximum scales approaching 
R ~ 25/i _1 Mpc in the 2dF Galaxy Redshift Survey. 

The characteristics of voids and the galaxies that populate 
them have been the subject of numer ous theoretical and observa- 
tional studies throu ghout the years dGregory & Thompsonl | l97j ; 



have found hints of large-scale structure at redshifts around z ~ 5 
JShimasaku et alj2003Ll200d : iHu et alj|2004 lOuchi et alj2005h . 

The modelling of voids poses an interesting theoretical prob- 
lem. There have been numerous studies utilising iV-body simu- 
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Hoyle & Vo gelev 20041; IConrov et alj 12005^ To date, these stud 



ies have mostly focused on low redshifts. However, it is clear that 
voids should appear at higher redshifts also. The DEE P2 survey in - 
dicates that voids exist at redshifts of z ~ 1 iConrov et all 2005). 
Surprisingly, a handful of recent Lyman a emitter (LAE) surveys 
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lations jMat his & White 20021: iBenson et al.ll2003 : lGottlober et al.1 
l2003l : IColberg et al.ll2005f) . While these simulations are invaluable 
tools for understanding the details of void dynamics, they are com- 
putationally expensive due to the large volumes and high dynamic 
range required to include a representative sample of voids while 
also resolving the much smaller galaxies that define them. 

Analytic methods provide a useful alternative. Perhaps the 
most promising analyt ic model of void abundances is t he excursion 
set approach taken bv lSheth & van de Wevgaertl ( l2004h . They argue 
that voids actually provide deeper insight into large-scale structure 
than halo formation itself. Their assertion is based on the fact that 
underdense regions generally tend to evolve toward a spherical ge- 
ometry, making the idealisation of spherical expansion more rea- 
sonable. In contrast, gravitationally bound objects typically have 
geometries that are far from spherical. Approximating gravitational 
collapse with the spherical model may be highly inaccurate, which 
partially explains the discrepancies between the Press & Schechter 
( 1974) halo mass function and simula tions ISheth & Tormen 1999; 
ISheth et alj|200ll ; |jenkins et alj200lh . 

A key disadv a ntage to the approach of 
ISheth & van de Weygaertl fc004l) is the difficulty in relating 
their definiti on of voids to observational st udies. As we will dis- 
cuss in ^2.21 Sheth & van d e Weygaertl J2004I) use the dark matter 
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underdensity to define voids. Furlanetto & Piraji| d2006h extend 
their model to define voids in terms of the local galaxy underden- 
sity. They predict characteristic void sizes of R ~ 10/iT 1 Mpc at 
the present day - nearly as large as observed voids. 

In what follows, we present analytic estimates of void size dis- 
tributions at redshifts between z ~ 5 — 10. Our aim is to provide a 
convenient basis of comparison for current and future high-redshift 
observations - presumably, though not limited to, LAE surveys. As 
such, we consider the effects of statistical fluctuations in the galaxy 
counts and the abundance of Ly a emitting galaxies on void obser- 
vations. 

LAE surveys have become an invaluable tool in cosmologi- 
cal studies. In addition to building larger samples at z ~ 5, ob- 
servers have pushed the threshold to redshifts as high as z ~ 
7- 10 dWillis & Courbiri2005l ; ICubv et alj2007l;IStark et alj2007l ; 
lOta et alj|2007ir Surveys at these redshifts could potentially reveal 
important information on the epoch of reionization. Indeed, the ob- 
served clustering pro perties of LAEs could s omed a y be a powerful 
probe of the epoch fFurlanetto et alj|2004 120061 : iMcOuinn et al.l 
2006, 120071: iMesinger & Furlanettoll2007t) . Regions of ionized hy- 
drogen grow quickly around clustered galaxies as reionization pro- 
gresses. When these regions are large enough, Ly a photons are suf- 
ficiently redshifted before they reach neutral hydrogen gas, allow- 
ing them to avoid absorption in the intergalactic medium (IGM). 
Sources within overdense regions are therefore more likely to be 
observed relative to void galaxies, resulting in a large-scale mod- 
ulation of the number density. One method to qua ntify such clus- 
tering is with void statistics, as first attempted by McOui nn et al.l 
( 120071) . This provides comparable power to correlation function 
measurements of the galaxies. However, taking full advantage of 
this technique requires a deeper understanding of voids in the un- 
derlying galaxy distribution; our calculations aim to provide such a 
baseline model. 

The remainder of this paper is organized in the following man- 
ner. In j j2! II we briefly review the basic principle s of the excursion 
set for malism. In section 12.21 we present the Furla netto & Pirar] 
definition of voids in terms of the local galaxy underden- 
sity. Section [2~3l contains the main results of this paper: void size 
distributions at z = 4.86 — 10. In Sj3] we estimate the typical sizes 
of voids that result from random fluctuations in the galaxy distribu- 
tion and develop an alternative definition of voids. In Sj4] we explore 
the assumption that only a certain fraction of galaxies are actually 
visible in LAE surveys. Section [5] contains a rough comparison of 
our calculations to high redshift Ly a surveys. Finally, we offer 
concluding remarks in 

In what follows, we assume a cosmology with parame- 
ters Q m = 0.24, fi A = 0.76, n b = 0.042, H = 
WOh km s" 1 Mpc" 1 (with h = 0.73), n = 0.96, and a s = 0.8, 
consistent with the latest measurements dSpergel et al]|2007t) . All 
distances are reported in comoving units. 



2 VOIDS AT HIGH REDSHIFTS 

2.1 Voids in the excursion set formalism 

In this section, we briefly summa rise the extension of excursion se t 
principles to voids pioneered by Shet h & van de Weyg aert (2004). 
For an excellent review on the excu rsion set formal ism and its many 
applications, we refer the reader to lZentnerl d2007l) . 

The approach we describe here is in many ways similar to 
the e xcursion set formulation of the dark matte r halo mass func- 
tion dPress & Schechterll 1974 [Bond et alj|l99ll) . At a fixed point 



in space, the linear density contrast 8 is smoothed on a scale R. 
We will denote the smoothed version of the linear density con- 
trast with 8 L (R). The variance of the smoothed density contrast 
<y{R) 2 is simultaneously computed for each scale R. The set of 
points [5 L (R), cr(-R) 2 ] define a trajectory parametrized by R in 
the (8 L ,a 2 ) plane. 

It is often convenient to work in coordinates in which the den- 
sity contrast 8 L is linearly extrapolated to the present day. The lin- 
ear density contrast at some redshift z is related to the extrapolated 
version through 8 L (z) = D(z)Sq, where D(z) is the growth fac- 
tor from linear perturbation theory normalised to unity at z — 
and Sq is the linear density contrast extrapolated to present day. In 
this paper, we will work almost exclusively in linearly extrapolated 
coordinates. For brevity, we will drop the subscript "0". Whenever 
it is necessary to consider quantities that have not been linearly ex- 
trapolated, we will note it in the text. 

We define a void within the excursion set formalism to be 
a region of scale R with smoothed density contrast 8 L (R) that 
has fallen below a potentially scale-dependent density contrast 
threshold 5%, henceforth referred to as the void barrier. This 
barrier is analogous to the critical overdensity S c used in the 
IPress & Schechtej ( fl97j) formalism. To proceed, we must there- 
fore specify the analogous threshold for voids. There are two sen- 
sible choices. The first option is to consider the physics of dark 
matter. Alternatively, one may rely upon observational parameters, 
as we will in 82.21 

In order to avoid counting smaller voids that are embedded 
within larger scale voids - the so-called void-in-void problem - 
we only consider the largest scale (smallest a 2 ) at which a given 
trajectory crosses 8% ■ Hence, the most important step in calculating 
the void mass function with excursion set techniques is to obtain the 
distribution of scales R at first crossing. 

There is, however, an important difference between the halo 
and void formalisms. In deriving the void mass function, we must 
be careful to exclude voids that are embedded in larger overdense 
regions that will eventually collapse into virialized objects. These 
voids will be crushed out of existence during the collapse of larger 
scale overde nsities (this is the so-called void -in-cloud problem). To 
address this. lSheth & van de Weygaertl d2004l) introduced a second 
absorbing barrier, henceforth referred to as the void-crushing bar- 
rier Sp . They argue that an appropriate choice for the void-crushing 
barrier lies between 5p = 1.06 — 1.69. We therefore seek the distri- 
bution of scales R at which trajectories first cross 6„ without hav- 
ing crossed 5p . The problem of calculating the void mass function 
is reduced to solving the diffusion equation in the (8 L ,a 2 ) plane 
with appropriate boundary conditions at two absorbing barriers. 



2.2 Denning voids in terms of the galaxy underdensity 

In this section, we obtain the linear underdensities defining vo ids in 
the excursion set formalism. ISheth & van de Weygaertl d2004l) use 
the scale independent underdensity corresponding to shell-crossing 
in the spherical expansion model (5^ = —2.81). Shell-crossing 
occurs when fast moving mass shells from the interior of voids run 
into initially larger and slow moving mass shells, forming an over- 
dense ridge. While it is certainly reasonable to define voids through 
shell crossing, analytic calculations utilising this criteria yield char- 
acteristic void sizes at the present (~ 3/i _1 Mpc) that are signifi- 
cantly smaller than observed (R ~ 15/i _1 Mpc). Moreover, a void 
formation criterion based on the evolution of dark matter underden- 
sities is difficult to reconcile with observational data. Of course, 
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Table 1. Comoving LAE Densities and Mass Thresholds at High Redshifts 



z 


n g (PS; 10- 4 Mpc" 3 ) 


n g (ST; 10- 4 Mpc" 3 ) 


m min (xl0 10 M Q ) 


4.86 


1.0 




32 


5.7 


5.4 




7.9 


6.5 


2.6 




6.5 


Fixed m min 


1 


61 


108 


1.0 


8 


19 


47 


1.0 


9 


5.4 


19 


1.0 


Fixed z 


10 


10,413 


14,709 


0.01 


10 


220 


495 


0.1 


10 


1.3 


6.8 


1.0 


10 


6.3 x 10~ 4 


1.6 x 10" 2 


10 



Where applicable, columns 2 and 3 show the number densities obtained with the Press-Schechter and Sheth-Tormen mass functions respectively. 



observational surveys ar e only sensitive to galaxy underdensities. 
Following this reasoning, Furlanetto & Piran (2006) extend the an- 
alytic model of lSheth & van de Weyga ert ( 2004) by utilising a void 
formation criteria that defines voids in terms of galaxy underdensi- 
ties. 

Consider a region of space with a linear density contrast 
5 L (R). The mass contained within the region is given by M = 
po(l + <5)(47r/3).R 3 , where po is the average matter density to- 
day and 8 is the true density contrast. We use the spherical ex- 
pansion model in order to relate the true density contrast in a re- 
gion to the local linear density contrast 8 L (pre-extrapolation). In 
this paper, we use a fit to the function 8 \8 L ~\ given by equation 
(18) of lMo& White] dl996l) . Although they assume an Einstein- 
de Sitter cosmology, the fit is an excellent approximation to the 
ACDM version at high redshifts (accurate to a few percent). The 
spherical model and fit both break down at 8 L = —2.81, cor- 
responding to shell-crossing, since mass is no longer conserved 
(the underdensities relevant to this work never reach such low val- 
ues). The function 8 [<5 L ] allows us to write the mass contained 
within a spherical region as M = po (4n/3)(R/ri) 3 , where 
V ^)=[l + S(S L )]- 1/3 . 

Using the halo model of structure formation, the comoving 
nu mber density of observabl e galaxies within the considered region 
is dFurlanetto & Piraij2006l) 



n g (m min \8 ,M) 



dm h (N(m h )) n h (m h \S L ,M) (1) 



where mi is the dark matter halo mass, {N(rrih)) is the average 
num ber of galaxies per halo, is the condi tional halo mass func- 
tion teond et alj|l99ll : lLacev & Colejll993h . and m min is a halo 
mass detection threshold. For the high redshifts of interest here, we 
assume that the average number of observable galaxies per halo is 
unity above m m i n and zero below it. 

Where possible, we will fix m m in by normalising the comov- 
ing galaxy density to observational data. Recently, there have been 
several claimed identifications of large-scale structure in LAE sur- 
veys. For redshifts of z = 4.86, 5.7, and 6.5, we will use the 
comov i ng number densit i es fro m the surveys of Shimasa ku et al.l 
d2004l) . IShimasaku etal] fcOOd) . and iKashikawa et al J d2006h re- 
spectively. Both of the first two surveys found evidence for large- 
scale structure at high redshifts. 



For z = 5.7, we integrate the Schechter function fit to the 
luminosity function obtained by Shimasak u et al.l J 2006h to L = 
3 x 10 42 ergs/s. Similarly, Kas hikawa et al. I d2006h provide upper 
and lower limits for the luminosity function at z — 6.5. We use the 
upper estimate with a fixed a = —1.5. Owing to the lack of ob- 
servational data at higher redshifts, we do not normalise the galaxy 
number densities to observational data. Instead, we simply spec- 
ify various halo mass thresholds to define a set of mythical high- 
redshift surveys. Table[T]shows the comoving number densities and 
halo mass thresholds obtained for redshifts between z = 4.86 and 
z = 10. 

For the conditional halo m ass function in equation (T]>, we 
use th e excursion set expression jBond et al.lll99l| - lLacev & Cold 
Il993l) 



n h (m h \S , M) 



Po 



diner 



dlnrrih 



a (M) : 



13/2 



x exp 



(8^ - 8 L f 
~2[a 2 -a(M) 2 ] 



(2) 



where 8^ is the critical density contrast at collapse, extrapolated 
to the present day. Although equation (|2j provides a reasonable 
approximation t o the halo abundance, it is c ertainly not the mos t 
accurate choice. ISheth & Torrnen] d 19991) and I Jenkins etal] d200lh 
obta in more accurate fits to the results of numerical simulations 
(see Cohn & Whitdl2007l for recent tests at high redshifts). Table 
Q] compares the mean number densities obtained from the different 
mass functions. Owing to a larger high-mass tail with respect to 
the Press-Schechter form, the Sheth-Tormen m ass function clearly 
results in larger number densities. However, as lFurlanetto & Piranl 
d2006l) point out, normalising the comoving galaxy number density 
to observed values significantly decreases the differences between 
mass functions. Our choice of analytic mass function in equation 
(|2j therefore suffices for the calculations in this paper. 

We can now use equation |Q} to write down a relationship 
between the observable mean galaxy underdensity 8 g in a region 
of size R and the lineari sed dark matter density contrast 8 L (R) 
dFurlanetto & Piranll2006l) : 



i _i_ x c-L r»\ n g (m min \8 L ,M) 
l+o g (m min ,d ,H) = = -, r — . 

■tf n g {m m in) 



(3) 
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Note that the factor of rj~ 3 is used to transform the numerator on 
the right hand side from Lagrangian to Eulerian coordinates. 

Computation of the void barrier requires a suitable choice for 
the galaxy underdensity defining a void. Ideally, 5 g would be cho- 
sen to most accurately reflect the void finding algorithm in the 
survey of interest. Unfortunately, the aforementioned LAE sur- 
veys are not large enough to allow a systematic search for voids. 
Hence, an appropriate ch oice for S g is not at all clear. Following 
iFurlanetto & Piranl £2006), we choose S g = —0.8 as the fiducial 
value for calculations in this paper; we will consider a modified 
definition in §3.2\ 

The prescription for defining voids in terms of the galaxy un- 
derdensity is now straightforward. We first set n g (m m i n ) equal to 
the mean galaxy density extracted from observational surveys and 
solve for the corresponding m m in- We then define a void to be a 
region with a given galaxy underdensity S g and solve equation {3} 
for the corresponding 5^(R) to b e used in the excursion set for- 
malism ( Furlan etto & Piran 2006). Several examples of such cal- 
culations are shown in Figure[T] Figure[T](a) shows void barriers at 
z — 4.86, 5.7 and 6.5. Panel (b) shows higher redshift void barriers 
at z — 7—10 for a fixed halo mass threshold of m m i„ — 10 10 Mq . 
Panel (c) shows the void barriers for several halo mass thresholds 
and a fixed redshift of z — 10. For reference, a 2 w 1.58,0.72, 
and 0.42 for R = 5, 10, and 15 Mpc respectively. Owing to the 
increased bias of galaxy haloes relative to the underlying matter 
density at high redshifts, the matter density contrast required to 
produce a mean galaxy underdensity of S a = —0.8 is smaller com - 
pared to the z = case (see Figure 2 in Furlanetto & Piran 2006). 

Figure[T]illustrates that for large R, voids must be underdense 
in dark matt er as one would expect. Ho wever, we note that in the 
formalism of IFurlanetto & Piranl i2006l) . small voids may actually 
correspond to regions that are overdense in dark matter (5% > 0). 
This is due to finite size effects. For both cases what is important is 
that voids are defined to be regions that are underdense in galaxies 
(S g < 0). We shall see that, for most scales of interest, voids in the 
galaxy distribution do in fact correspond to dark matter underden- 
sities. 
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2.3 Void size distributions 

Using the dark matter underdensities obtained in ^2.2! as void bar- 
riers, we are now in the position to calculate void size distributions 
within the excursion set formalism. Most of the void barriers shown 
in Figure [T] are well approximated as linear functions of a 2 (M). 
One approach is to solve a diffusion problem in the (S L , a 2 ) plane 
with one linear absorbing barrier - the void barrier - and one con- 
stant absorbing barrier - the void-crushing barrier. Owing to the 
non-trivial boundary conditions, obtaining an analytic solution for 
this problem is rather difficult. Numerical techniques for obtaining 
the first-crossing distribution with generic b oundary condit ions do 
exist (for an overview of such techniques, see lZentne J 2 007 ) . How- 
ever, in the interest of obtaining analytic solutions, we approximate 
the first-crossing distribution with a solution involving two linear 
absorbing barriers of the form 5% = A v +f3a 2 and 6i = A p +f3a 2 . 
We find that for A p = 1.06, 5p < 1.69 over the range of interest 
for the models considered in this paper. More importantly, we show 
in ^2.41 that the void-crushing barrier has little effect on the calcu- 
lated void size distributions anyway. 

A full derivation of the mass function in the case with two 
linear absorbing barriers with identical slopes can be found in 
Furlanetto & Pira^ fc006l) . We provide a brief summary here. In 
the following discussion, we set S = a 2 for simplicity. 



Figure 1. Linear underdensity thresholds (pre-extrapolation) denning voids 
in the excursion set formalism. All curves assume a galaxy underdensity 



of S a 



-0.8. (a): The solid, dashed, and dotted curves show void 



barriers at z = 4.86,5.7, and 6.5 respectively. Comoving galaxy num- 
ber de n sities are normal i sed to data in the surveys o f Shimasaku et al. 
120041) . IShimasaku etal] J2006l) . and iKashikawa etal] 120061) . (b): Void 
barriers at redshifts of z = 7, 8, 9, and 10 (solid, dashed, dotted, dot- 
dashed respectively). A fixed halo mass threshold of m m i n = 10 10 Mq 
is assumed, (c): The solid, dashed, dotted, and dot-dashed curves show 
void barriers at a fixed redshift of z = 10 for halo mass thresholds of 
m min = 10 s , 10 9 , 10 10 and 10 11 M respectively. 



The probability that a trajectory will cross the void barrier first 
(i.e. before the void-crushing barrier) at a scale between S and S + 
dS is given by dFurlanetto & Piran 2006) 



F v (S)dS = 



n tv D sin(n7rD) 



Az 



exp 



n=l 

x exp \ —/3A v 



n\ 2 D 2 
2A 2 /S 



-p 2 S/2\ dS. 



(4) 



Here, D = \A V \ /(A v + \A V \) and F V (S) is known as the first- 
crossing distribution. 

Following equation jH >, the desired mass function has the form 
(Furlanetto & Piran 2006) 
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cr 
> 



A 




100 



Figure 2. Void size distributions and volume filling fractions at redshifts 
of z = 4.86 (solid), 5.7 (dashed), and 6.5 (dotted). See Table[T]for model 
parameters. 



n v (M) = n^ CS (M)exp (-0A V - f3 2 a 2 /2) 
where the function n 2CB (M) is given by 



(5) 



nl CB (M) 



2 po 
M 2 

x exp 



din a 



E 



dlnM 



n=l 
2 2 t-,2 



n 2 n 2 D 2 sm(nnD) 
(A v /a) 2 nit 



2{A v /a) 



(6) 



Note that equation l(6j is the mass function obtained in the 
case with two constant barrie rs 8 V = A v and 8p = A p 
dSheth & van de Weygaertll2004l) . 

Converting equation {4]l to units of distance to obtain the frac- 
tion of voids per logarithmic interval in R yields VR n v (R), where 
the volume V and radius R are in comoving coordinates. Hence, the 
fraction of volume contained within voids of radius greater than R 
is given by 

F vo i{> R) -- 



Vn v (r) dr 



-^jf n v {m) dm. (7) 

M PO 

In FigureH we plot VR n v (R) and F vot (> R) for z = 4.86, 5.7, 
and 6.5 using the parameters given in Table Q] Typical voids in the 
LAE distribution at z ~ 5 are roughly 20 comoving Mpc across 



in our calculations. The z = 4 .86 voids are largest because the sur- 
vey of IShimasaku et alj {2004) has the highest detection threshold 
(see TableQJ. Similarly, Figure[3]shows VR n v (R) for z — 7, 8, 9 
and 10 for a variety of halo mass thresholds. Using equation Q 
and m m zn = 10 10 Mq, we find that approximately 21, 26, 31 and 
37 % of space is filled by voids with radii larger than 10 Mpc at 
z = 7,8, 9, and 10 respectively. Panel (6) in Figure [2] shows that 
F vo i{> R) approaches unity as R — > 0, indicating tha t the entire 
universe is filled by voids. As lFurlanetto & Piranl d2006h point out, 
this peculiarity is primarily due to the fact that we have included 
voids embedded within regions that are not quite at turnaround. 
These voids will likely be suppressed by the surrounding overden- 
sities. By allowing them to expand fully, we have overestimated the 
volume contained within voids. 

The peaks in Figures [2] and [3] occur where a ~ 8% . The large 
scale cutoffs are due to the smoothness of the matter density field 
at large scales. As R — > oo, o 2 (R) — > and the probability 
of crossing the void barrier approaches zero. On the other hand, 
the small-scale cutoffs are a result of th e rising void barrier. Note 
that th is di ffers from the low-z result s of lSheth & van de Weygaerq 
(2004) and lFurlanetto & Piranl J2006I) . in which the small-scale cut- 
offs are due to the void-crushing barrier. As we have seen in i]2.2| 
high-redshift galaxy voids are not as underdense in dark matter as 
their present day counterparts and the void barrier actually crosses 
through 8y = 0. Most trajectories are therefore absorbed by the 
void barrier before reaching larger a 2 (R), resulting in a suppres- 
sion of the mass function for small scales. Section l2"l4l of this paper 
examines the role of the void-crushing barrier in detail. 

Figure [4] depicts the evolution of the void size distributions 
with redshift for a fixed halo mass threshold of m m i„ = 10 10 M©. 
The solid, dashed, dotted, and dot-dashed curves correspond to 
z = 0, 6, 8, and 10 respectively. Panel (a) shows that the charac- 
teristic scales of voids are actually slightly larger at higher redshifts 
(decreasing by roughly 2 Mpc per Az — — 2 between z = 10 and 
z = 6) due to a decreasing galaxy number density and increasing 
biasB Interestingly, the void scales decrease by only 2 Mpc be- 
tween 2 = 6 and z = 0. This is due to the competition between 
a decreasing spatial bias of galaxies with respect to the underly- 
ing matter and the gravitational expansion of underdense regions. 
Neglecting gravitational effects, the increased abundance and de- 
creased spatial bias of galaxies at lower redshifts would decrease 
the characteristic scales of voids. However, as we approach the 
present day, underdensities evolve through gravitational expansion; 
voids become deeper and larger as mass is evacuated from the inte- 
rior. These two effects work against each other, creating less net 
change in the characteristic scale of voids between z = 6 and 

2 = 0. 



2.4 The void-crushing barrier 

In equation l[5j, we assume that both the void and void-crushing 
barrier are linear functions of a 2 with the same slope. In this section 
we test how our results depend on the particular choice of void- 
crushing barrier. In the following discussion, all calculations will 
be performed using our fiducial void underdensity of 8 3 = —0.8 
and at a redshift of z = 5.7. See TableQ]for the model parameters. 



We found an error in the z-dependent mass function used by 
iFurlane tto & Piran 12006). As a result, their Figure 7 incorrectly indicates 
that galaxy void sizes decrease with redshift. 
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Figure 3. (a) — (d): Void size distributions at redshifts of z = 7 — 10 for several different halo mass thresholds. Solid, dashed, dotted, and dot-dashed curves 
assume m min = 10 s , 10 9 , 10 10 and 10 11 M respectively. 



First, recall that we have used a linear function for the void- 
crushing barrier, introducing some scale dependence to the void- 
crushing with little physical basis. Fortunately, the relevant range of 
a 2 is small enough at high redshift that this makes little difference: 
with A p = 1.06, we have S p = 1.15 at R = 10 Mpc for z = 5.7. 

We have previously used A p — 1.06, corresponding to the 
linear density contrast at turnaround in the spherical model. Our 
first task is to vary this value. The thin solid, dashed, dotted, and 
dot-dashed curves in Figure [5] show the void size distribution at 
z = 5.7 for A p = 0.3, 0.5, 1.06, and 1.69 respectively. Note that 
the curves with A v = 1.06 and 1.69 are identical and lie within the 
thick solid curve. 

Figure [5] shows that the particular choice of A v has a minor 
effect on the void size distribution at most. If A v is comparable 
to \A V \, a small number of trajectories will encounter the void- 
crushing barrier before the void barrier. These trajectories are sub- 
tracted from n v (R) since they represent voids that will eventually 
be crushed out of existence. In Figure|5] this suppression is visible 
at small R for A p = 0.3 and A p = 0.5. We note that no suppres- 
sion is seen in the cases where A v = 1.06 and A v — 1.69. We will 
see in ^3.1 1 that the issue of small-scale suppression is irrelevant 
anyway because of stochastic fluctuations in the galaxy distribu- 
tion. 

On the other hand, if A p \A V \, then the probability that a 
trajectory will encounter the void-crushing barrier first at small R 
is negligible. In this case, the small scale cut-off of the void size 
distribution is not due to the void-crushing barrier. As R decreases, 



5 V crosses zero, trapping most trajectories where 6% ~ 0. Hence, 
the void barrier itself absorbs most trajectories before they reach 
small R. 

Following the argument above we would expect that the void 
size distribution becomes independent of the void-crushing barrier 
as A p gets larger. To illustrate that this is indeed the case, it is in- 
structive to consider the diffusion problem with only one linear ab- 
sorbing bar rier of the fo rm 8^ — A v + PS. The appropriate mass 
function is dShethll 19981) 



nl LB (M) 



2 pa \ A V \ 
ty M 2 a 



d In a 



dlnM 



exp 



[pa 2 + A v f 



2a 2 



•(8) 



We plot the void size distribution obtained with equation ([8) as the 
thick solid curve in Figure [5] It is indeed virtually identical to the 



cases where A % 



1.06 and 1.69. 



The weak dependence of our results on the void-crushing bar- 
rier also allows us to obtain an approximate analytic expression for 
the fraction of mass contained within voids with masses greater 
than M, In what follows, we neglect the void-crushing barrier en- 
tirely and assume a single linear void barrier wi t h A v < and 
P > 0. Following the appendix of Mc Ouinn et al.l 1 120051) . the frac- 
tion of trajectories that cross the void-barrier between scales of S 
and 5" + dS is 



-4Qw dS. 



(9) 
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Figure 4. Void size distributions and volume filling fractions at z = 
(solid), 6 (dashed), 8 (dotted), and 10 (dot-dashed) for a fixed m m i n = 
10 10 Mq . Voids are larger at high redshifts due to a decreased galaxy num- 
ber density and increased bias with respect to the underlying dark matter. 



where 

Qit(y,S) 



exp 
1 



V 



2/3 2 S 
: exp 



exp 



(y - 2A V (3) 2 



2/32 5 



(5 2 S 
2 



(10) 



Note th at our limits of integration differ from those in equation 
(Cll) of McOui mTet alJd2005h since A„ < 0. Integrating equation 
© from to S yields 



F(> M) = 1 - -erfc 



x 1 + erf 



A v + f3a 2 (M) 

V2a(M) 
A v -f3a 2 (M) 
V2a(M) 



+ 



exp(-2A„/3) 



(11) 



In most cases of interest, equation i ll It quite accurately approx- 
imates the fraction of mass contained within voids with mass 
m > M at high redshifts (including the void-crushing barrier). 
Note, however, that the fraction of space containing voids larger 
than a given radius is not as straightforward to compute, because 
the volume conversion factor 77 is a function of the void mass. 



> 




100 



Figure 5. Void size distributions at 2 = 5.7 for different void-crushing 
barriers of the form & p = A p + f3S. The thin solid, dashed, dotted, and 
dot-dashed curves assume A p = 0.3, 0.5, 1.06 and 1.69 respectively. The 
thick solid curve shows the void size distribution obtained by neglecting the 
void-crushing barrier entirely. Note that the dotted and dot-dashed curves 
are obscured by the thick curve, indicating that the size distribution is es- 
sentially independent of the void-crushing barrier for reasonable choices of 

Art. 



3 STOCHASTIC FLUCTUATIONS IN THE GALAXY 
DISTRIBUTION 

3.1 Stochastic voids 

The small galaxy densities in TableQ]refiect the fact that observable 
galaxies are increasingly sparse at high redshifts. Their random 
fluctuations will form large empty regions, henceforth referred to 
as "stochastic voids". These are inherently different from the voids 
we model with the excursion set approach. They do not form grav- 
itationally and do not yield any useful information on large-scale 
structure. For the range of galaxy densities we consider, they are 
a major source of noise that could potentially obscure meaningful 
measurements in high-redshift surveys. In this section, we estimate 
the sizes of typical stochastic voids in order to determine the likeli- 
hood that they will be misidentified as real voids. 

Our first task is to define a stochastic void properly. Sup- 
pose that galaxies were truly randomly distributed, obeying Pois- 
son statistics. In that case, the probability that a region of comov- 
ing volume V and mean galaxy number density n 9 will contain 
zero galaxies is given by the well known formula 



Po(V) = exp [-n g V] . 



(12) 



Equation H21 does not account for the fact that an empty re- 
gion may lie inside of a larger empty region. In order to avoid over- 
counting smaller stochastic voids (much like the void-in-void prob- 
lem), we define a stochastic void as the largest sphere that will fit 
inside of an empty region in a random distribution of galaxies. The 
probability that a stochastic void will have a radius between R and 
R + dR is the probability that a sphere of radius R is empty mul- 
tiplied by the probability of encountering at least one galaxy when 
the radius of the sphere is enlarged by dR. The latter is simply 
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Figure 6. (a) — (6): Size distributions and volume filling fractions of true 
(dashed) and stochastic (solid) voids for z = 5.7 (thin) and z = 10 (thick). 
At higher redshifts, stochastic voids are typically the same scale as true 
voids, making the identification of large-scale structure difficult. 



n g 4:-KR 2 dR. Thus, the probability that a stochastic void will have 
a radius between R and R + dR is given by 

dP{R) = n g ATvR 2 exp [-n 9 (47r/3)P 3 ] dR. (13) 

To obtain a quantity that is directly comparable to our previ- 
ous calculations, we consider the stochastic void probability per 
logarithmic interval in R. Figure [6] (a) shows dP(R)/d\n R at 
z — 5.7 and z = 10. The fraction of space filled by stochastic 
voids larger than R, obtained by integrating equation J 1 3 1 > from 
R to oo, is shown in panel (b). For both plots, the solid curves 
correspond to stochastic voids while the dashed curves represent 
true voids. The thin and thick curves correspond to z = 5.7 and 
2 = 10 respectively. At z — 10 we assume a halo mass threshold 
of m min = 10 10 M Q . 

Panel (a) shows that random fluctuations in the galaxy distri- 
bution are slightly smaller than true voids at 2 = 5.7, making the 
identification of real voids with radii below the characteristic size 
difficult. However, the stochastic void distribution displays a sharp 
cutoff at R ~ 10 Mpc due to exponential suppression. Thus, in or- 
der to minimize the contamination of void samples with stochastic 
voids, it is necessary to seek real voids with scales of R > 10 Mpc. 
This is, of course, a model-dependent statement: if we imposed a 
less rigorous definition for "true" voids (allowing them to be, say, 



only 50% underdense), they would become larger and more easily 
differentiable from stochastic voids. 

As we discuss in £|2.2| we assume a lower luminosity limit of 
3 x 10 42 ergs/s when calculating the real void size distribution for 
2 = 5.7. The luminosity limit w as chosen to be consist ent with 
the detection threshold reported by Shimasa ku et al.l ( 120061) . Lower 
detection thresholds reduce the characteristic scales of stochastic 
voids more than real voids. While real voids sizes decrease by a 
maximum of ~ 30% for a lower luminosity limit of 3.75 x 10 41 
ergs/s, stochastic void sizes decrease by ~ 50%. Surveys with 
lower detection thresholds are therefore much better suited to iden- 
tify real voids. 

The situation at higher redshifts depends on the halo mass de- 
tection threshold. At redshifts of z = 7, 8, 9, and 10, the char- 
acteristic scales of stochastic voids are roughly equal to those of 
real voids for m min = 6.0 x 10 10 , 3.0 x 10 10 , 1.5 x 10 10 , and 
8.2 x 10 9 Mq respectively. For m m i n larger than these values, 
stochastic voids have larger characteristic scales than real voids 
and vice versa. Thus, future surveys must obtain increasingly lower 
detection thresholds in order to obtain useful information on void 
properties at 2 = 7 — 10. 

For completeness, we note that the Sheth-Tormen mass func- 
tion yields smaller stochastic void sizes due to the increased mean 
number density at higher redshifts (see Table QJ. The results pre- 
sented in Figure [6] are also quite sensitive to the choice of as be- 
cause galaxy densities are extremely sensitive to this parameter at 
these redshifts. 

3.2 Empty voids 

The calculations above suggest that a significant portion of the 
high-redshift sky should consist of voids - both true and stochastic. 
Although we have defined the latter to be completely empty regions 
(in contrast to true voids, which just have small overall densities), 
in practice high-redshift galaxies are so rare that the two will be 
difficult to distinguish. For example, we expect only 0.45 and 0. 1 1 
galaxies inside each void with a diameter of 20 Mpc at z = 5.7 
and 2 = 10 (assuming m m in = 10 10 Mq for the latter). The simi- 
lar appearances of both kinds of voids, together with the relatively 
small observational samples so far obtained, suggests a modified 
definition of the void barrier that accounts for the stochastic nature 
of the galaxy number counts inside of voids. 

Consider an ensemble of underdense regions with comov- 
ing volum e V and a mean comoving galaxy density of n g — 
n g ( 1 + 5 g ) . Casa s-Miranda et ail ( 2002 ) found that the galaxy num- 
ber variance in underdense regions at the present day is very close 
to the Poissonian value. We therefore assume that the galaxy num- 
ber is Poisson distributed about the mean value n g V. The proba- 
bility that an underdense region contains zero galaxies is given by 
equation dl2t with n g — > n v g . Writing Po(V) in terms of the mean 
galaxy underdensity S g and the total mean galaxy density n g yields 

Po(V) = exp [-n g (l + S g )V] . (14) 

Equation d 14b suggests a simple way to define voids in terms 
of the probability that a region will be completely empty. In our new 
definition of voids, we fix the value of 8 g such that the probability 
Po that a region of space will be empty is constant for all scales. 
Inverting equation dl4t to find S g in terms of R and Po yields 

^i^" 1 - (15) 
For a fixed Pq, we plug equation dl5t into equation ((3} to obtain 
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Figure 7. Void barriers (pre-extrapolation) at z = 5.7 (thin) and z = 10 
(thick) assuming that 50% (solid) of voids at all scales are observed to be 
completely empty. The dashed lines correspond to the fiducial model with 
8 9 = -0.8. 

the dark matter density contrast 5^ required to produce an average 
galaxy underdensity of S g (R, Po). We then fit 5% to a linear func- 
tion of a 2 such that 5^ is well approximated in the R « 5—20 Mpc 
regime. 

The void distributions we obtain using equation J 1 5t actu- 
ally contain both stochastic and real voids. The former and latter 
dominate the distribution at small and large R respectively. The 
transition between the two regimes occurs at the scale for which 
S g {R) ~ 0, or Ro ~ [31n(l/P ) / 4tto 9 ] 1/3 . For R > Ro, 5 g 
quickly approaches —1. Voids with scales such that S g (R) < —0.8 
represent real galaxy voids as we have defined them in §2.2\ For 
R < Ro, the mean galaxy underdensity is greater than zero. Al- 
though regions with this scale have a probability Po of being empty, 
they are, on average, overdense in both galaxies and dark matter. 
Hence, the voids obtained in our new model with scales R < Ro 
are statistical fluctuations that result from finite size effects. We 
note that equation J 1 5t is not a perfect definition of empty voids 
since a fraction 1 — Po of voids will contain galaxies. Nonetheless, 
it is the closest we can come to defining voids as empty regions in 
the excursion set formalism. 

The thin and thick solid curves in Figure[7]show the void bar- 
riers at 2 = 5.7 and z = 10 respectively in our new definition. We 
use the parameters from Table[T]and an empty fraction of Po = 0.5. 
The dashed curves in Figure [7] show the fiducial model void barri- 
ers. 

The solid and dashed curves in Figure[8](a) show the void dis- 
tributions derived from the new and fiducial void barriers at z = 5.7 
respectively. The dotted curve shows the stochastic void distribu- 
tion obtained in i ]3.1l For reference, we show the mean galaxy 
underdensity required by equation d 1 5t in panel (b). As Figure [8] 
shows, the void distribution obtained with equation J 1 5 b has a char- 
acteristic scale that is slightly larger than the stochastic distribution. 
Both panel (6) and the dotted curve in panel (a) illustrate that the 
newly obtained distribution is dominated by stochastic fluctuations 
for R < 7 Mpc. The newly calculated void distribution is also more 
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Figure 8. (a): Void size distributions at z = 5.7. The solid, dashed, and 
dotted curves correspond to modified, fiducial, and stochastic void distribu- 
tions respectively, (b) : The average galaxy underdensity such that voids of 
all scales have a 50% probability of being empty. 

El 

sharply peaked than the fiducial curve. The steeper large scale cut- 
off is due to the fact that, in our new definition, larger voids must 
be more underdense in order to have a 50 % probability of being 
empty. Such underdense regions are increasingly rare at large R. 
We emphasize, however, that larger voids do exist as predicted by 
the fiducial model; they just do not satisfy our new criteria. 

The purpose of this section has been to show that stochas- 
tic voids present a significant contaminant at high redshifts. The 
expected sizes of deep large-scale voids (at least 80% underdense 
in galaxies) are always fairly close to the sizes of empty regions. 
Searches for large-scale structure must therefore either (1) con- 
fine themselves to extremely large scales 10 Mpc) and mod- 
est underdensities or (2) reach sufficient depth to detect small halos 
(~ 10 9 Mq) . The latter may in fact be possible if the lensed sources 
observed by IStark et all d2007l) prove to be at z = 9, sugg esting 
number densities ~ 0.1 Mpc -3 jMesinger & Fur lanetto 20q3). 



4 LAES AND THE FULL GALAXY POPULATION 

In sections 12.21 - 12.31 we have relied upon high-redshift LAE sur- 
veys to obtain the comoving galaxy number densities for redshifts 
of z = 4.86, 5.7, and 6.5. Thus far we have assumed that all galax- 
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Figure 9. Void size distributions in LAE surveys at z = 5.7 assuming that 
only a fraction of galaxies are sampled. The solid, dashed, dotted, and dot- 
dashed models assume that 100 %, 70%, 30%, and 10% of galaxies are 
LAEs respectively. The characteristic scales of true voids decrease if only 
a fraction of galaxies are detected, increasing the contamination of void 
samples by stochastic fluctuations. 



ies within haloes with masses greater than the threshold m m i„ are 
observable in these surveys. However, lower redshift studies sug- 
gest that only a fraction of galaxies have strong Ly a emission lines. 
For example. iGawiser et alj ( 120070 estimate that only ~ 1 — 10% 
of haloes with masses above ~ 10 10 ' 6 Mq are occupied by LAEs 
at z = 3.1. It is therefore reasonable to suspect that many high- 
redshift galaxies go undetected in LAE surveys. In this section, we 
consider what happens to the void size distributions when we as- 
sume that only a fraction of galaxies are sampled in LAE surveys. 
We will assume throughout that the processes determining whether 
or not a particular halo hosts an LAE are internal to the galaxy itself 
and so are independent of its environment. 

In what follows, we assume that a randomly chosen fraction 
of galaxies are LAEs. Mathematically, this decreases the mean ob- 
servable galaxy density derived from LAE surveys by a factor f v i s , 
from now on referred to as the visible fraction. The observable 
mean galaxy density is given by 



o / 

in 



) — fvis ?lg ['WI'min ) •, 



(16) 



where n g is the intrinsic mean galaxy density and the superscript 
"o" denotes observable. The mean galaxy density within an under- 
dense region n g (in min \6 L , M) decreases by the same factor f vis . 



The procedure for calculating void size distributions is only 
slightly modified from the usual case, because the halo mass 
threshold is now a function of f V i 3 . For a fixed observable mean 
galaxy density and visible fraction, equation d I fib is solved for 
(fvis)- As before, we define voids in terms of the observable 
mean galaxy underdensity through equation ((3). 

Figure|9](ci) shows the void size distributions at z — 5.7 for 
a variety of visible fractions in the fiducial model. Panel (b) shows 
the modified model of H3.2\ Both panels in Figure|9]show that the 
visible fraction has only a small effect on the void size distributions. 
Even under the assumption that 10% of LAEs emit, there is only a 
~ 30% difference in characteristic scales with the fiducial curves, 
and even less with the modified model. The visible fraction does 
however have an important effect on the role of stochastic voids. 
Since we have held the observable galaxy number density fixed, 
the corresponding stochastic void distributions are unaffected by 
f v i s . Therefore, as f V i a decreases and the characteristic scales of 
real voids gets smaller, void samples are increasingly contaminated 
by stochastic fluctuations. 



5 COMPARISON TO OBSERVATIONS 

Owing to a lack of statistics, a rigorous comparison of our cal- 
culations to observational surveys is not possible. In this section, 
we content ourselves w ith a rough comparison to the surveys of 
IShimasaku et alj j2006t) and lOuchi etail d2005l) . We focus on z = 

5.7 due to a number of recent claims of large-scale structure at this 
reds hift. 

IShimasaku et al.1 d2006h report evidence for the existence of 
large scale structure at z = 5.7 in their photometric sample of 89 
LAEs, including 34 spectroscopically confirmed objects. Their ar- 
gument is based on a roughly 20 % overdensity and underdensity 
in the western and eastern halves of their sky distribution respec- 
tively. Their survey covers a continuous area of 725 arcmin 2 and 
redshifts of z ~ 5.65 — 5.75, corresponding to a survey volume of 

1.8 x 10 5 Mpc 3 . We find a volume filling fraction F(> R) = 0.76 
for voids with at least half of their survey volume and S g — —0.2. 
The large volume filling fraction suggests that our result is con- 
sistent with the possibi lity that the observed underdense region in 



IShimasaku et al 



Ouchietal 



(2006) is a void progenitor. 



1 2005) report much more well-defined large-scale 
structure. Their catalog of 515 LAEs at z = 5.7, which covers 
an area of 180 Mpc x 180 Mpc x 48 Mpc, exhibits a high de- 
gree of clustering. They find clearly defined voids and filamen- 
tary features. Moreover, the voids depicted in their survey are ex- 
tremely large, ranging in size from 10-40 comoving Mpc in 
scale. Taking the scales of stochastic fluctuations and survey depth 
into account, these scales roughly correspond to void volumes of 
1.5 — 6.0 x 10 4 Mpc 3 , where we have approximated them to be 
cylindrical regions with lengths of 48 Mpc. A direct comparison of 
these voids to our predictions is problematic since our model as- 
sumes a spherical geometry. The best we can do is compare void 
volumes. The modified distribution shown in Figure|9]indicates that 
the voids of interest have a radii ranging from R ~ 10—15 Mpc, or 
comoving volumes of 0.42 - 1.4 x 10 4 Mpc 3 . Although our mod- 
ified definition yields volumes that are slightly smaller than the ob- 
served voids, the fiducial model in Figure [2] predicts the existence 
of a small number of larger scale voids with comoving diameters 
and volumes as high as 50 Mpc and 6.5 x 10 4 Mpc 3 respectively. 
Thus, we do not consider the large voids observed bv lOuchi et al.1 
(2005) to be in contradiction with our results. Note as well that this 
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model u nder-predicts the sizes of z — voids by a comparable 
amount dFurlanetto & Pirarj|2006h : the discrepancy may be due to 
redshift space distortions, the non-spherical regions relevant to this 
narrow-band survey, or our simplified void identification algorithm. 

Interestingl y, the widest an d most recent LAE survey con- 
ducted by Mura vama et alj J2007I) does not find conv incing evi- 
dence for the clustering observed bv lOuchietai1 l2005l Their sur- 
vey consists of 119 LAE candidates in a 1.95 deg 2 area, corre- 
sponding to a number density of 6.6 x 1 0~ 5 Mpc 3 . With such a 
small sky density (eight times smaller than Ouchi et al. 2003), true 
voids are masked by stochastic fluctuations (which have character- 
istic scales ~ 15 Mpc). 

Finally, we emphasize the difficulty in drawing conclusions 
from comparisons to sky distribution maps. Given the poor statis- 
tics, it is often difficult to determine conclusively whether a given 
empty region is a real void, and without redshifts we must com- 
pare our (spherical) predicted voids to cylindrical survey volumes. 
Furthermore, the samples we have described here are not com- 
pletely spectroscopically confirmed and probably contain a rea- 
sonable fraction of low-redshift contaminants. When voids are de- 
fined based on only a few galaxies, such contamination can signif- 
icantly affect the statistics (and, because the contaminants are also 
line-emitting galaxies at discrete redshifts, can introduce their own 
large-scale structure). At the very least, they affect the mass thresh- 
old of the survey (although probably not as much as uncertainty in 
f V i s ). Detailed comparisons will require simulations of the effects 
of these contaminants. The best we can say now is that there is 
no inconsistency with our model. Future surveys will undoubtedly 
allow for a more systematic comparison. 



6 DISCUSSION 

We have calculated void size distributi ons at z = 4.86-10 us- 
ing the exc ursion set model developed bv lSheth & van de Weygaertl 
(2004) and lFurlanetto & Piranl J2006I) . The latter found character- 
istic void radii of R ~ 7 — 14 Mpc at z = 0. For the observational 
sensitivities assumed in this paper, we obtained characteristic void 
radii that are very similar: R w 7 — 10 Mpc for redshifts between 
z = 4.86 and z = 10. These results are virtually independent 
of the void-crushing barrier (for any reasonable choice). We have 
shown that characteristic void scales actually increase with redshift 
for a fixed halo mass threshold due to a decreased number density 
and increased bias with respect to the underlying matter density. 
Following recent studies on the abundances of low-redshift LAEs, 
we explored the possibility that only a fraction /„, s of galaxies are 
sampled in LAE surveys. This has only a small effect on the void 
size distribution but increases the contamination of void samples 
by stochastic fluctuations. 

In section [3] we have explored stochastic fluctuations in the 
galaxy distribution. These fluctuations, although inherently differ- 
ent from the "real" voids we model in this paper, will result in large 
empty regions in the sky. Stochastic voids can therefore contam- 
inate real void samples and lead to erroneous conclusions on the 
formation of large-scale structure. We have estimated the typical 
scale of these regions to be slightly smaller than the characteristic 
scale of true voids at z ~ 5. At z ~ 10, the situation depends on the 
particular choice of m m i n . For m m i„ ~ 10 10 Mq , stochastic voids 
are typically the same scale as real voids. The increased importance 
of stochastic fluctuations will make the identification of large-scale 
structure at this redshift difficult. Attempts to do so must observe 



halos near the minimum mass to form stars, ~ 10 8 -10 9 Mq, in 
order for true voids to dominate the observed distribution. 

We found that a large fraction of real voids in our fiducial 
model contain no visible galaxies, adding to the difficulties in dif- 
ferentiating them from stochastic fluctuations. We have presented 
a modified definition of voids that incorporates both stochastic and 
real voids and so is easier to compare to the limited observational 
samples thus far available. In our new approach, we defined voids 
in terms of the probability for a region to be empty. We found that 
the modified void distributions are more sharply peaked and have 
characteristic scales that are comparable to the fiducial model. 

We have also attempted to visually compare our results to the 
most recent narrow-band filter surveys at 2 = 5.7. While we found 
no inconsistencies, it is difficult to draw any decisive conclusions 
because of small-number statistics, projection effects, and lower- 
redshift contaminants. Obviously, a more systematic approach is 
required. Future surveys promise to provide better statistics and in- 
creased sample volumes for studies on high-redshift voids. 

In the context of next generation surveys for high-redshift 
galaxies, our model is useful for gauging the impact of cosmic 
variance. Consider a fictitious survey at z = 10 with a detection 
threshold of m m i n — 10 10 Mq. Figure[6]illustrates that stochastic 
voids with R ~ 10 — 20 Mpc will dominate the sky distribution. 
Therefore, one must either search for voids with R > 20 Mpc or 
search deeper for significantly sm aller source s . The latter may be 
possible if the sources observed bv lStark et alj J2007I) are indeed at 
z ~ 9, in which case they im ply that halos near < 10 9 Mq are visi- 
ble dMesinger & Furlanettol2 007). However, high-redshift galaxies 
are so highly biased that even with deep observations, a substan- 
tial fraction of the Universe is filled with empty or nearly-empty 
regions. For example, at z = 10 and m m i n — 10 10 Mq, ~ 37% 
of space is filled by regions that are at least 80% underdense in 
galaxies and at least 20 Mpc across - or fully 7 arcmin. With the 
small fields of view available to near-infrared detectors, this sug- 
gests that either many independent fields must be observed or a 
large contiguous volume surveyed to be guaranteed of detecting a 
reasonable number of sources. 

Finally, we have neglected reionization and its effect on the 
appearance of large-scale structure. Regions of neutral hydro- 
gen are expected to modulate the LAE d ensity on large scales 
and accentuate the appearance of s t ructure ( Furlanetto et alj|2004l . 
l2006t lMcOuinnetalTl2006l 120071 : iMesinger & Furlanetto! l2007n . 
Although the precise time frame is currently unknown, quasar ob- 
servations and cosmic microwave background measurements have 
provided so me evidence that reionization occurred between z ~ 
6 - 10 (e.g lFan et al.ll2006l:|Page et al]|2007l; IMesinger & Haimar] 
l2004ll2007lb . Interestingly. iKashikawa et al.1 J2006I) found a signif- 
icant high-luminosity suppression in the LAE luminosity function 
between z = 5.7 and z — 6.5. Whether or not reioinization is re- 
sponsible for thi s effect is curr ently unclear (no such suppression 
was observed bv lDawson et alj|2007l) . 

Because IGM absorption modulates the LAE density on large 
scales, we would expect reionization to have a substantial effect 
on the observed void sizes in such narrow-band surveys (it should 
not affect galaxies identified through broadband effects). Of course, 
the plots in Figure[3]provide analytic estimates only of the intrinsic 
void size distributions. They provide a basis for comparison with 
high-redshift surveys in order to determine whether the observed 
features are easily attributable to the large-scale clustering alone. It 
therefore helps illuminate efforts to use voids to co nstrain the IGM 
proper ties during reionization, as first attempted bv lMcOuinn et al.1 
(120071) . 
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